Europaisches 
Patenfcamt 



oooo os- us 



European 
Patent Office 



Office europeen 
des brevets 



o 



V£>' 

u 



Bescheinigung Certificate 



Attestation 



Die angehefteten Unterla- 
gen stimmen mit der 
ursprunglich eingereichten 
Fassung der auf dem nach- 
sten Blatt bezeichneten 
europaischen Patentanmel- 
dung uberein. 



The attached documents Les documents fixes a 
are exact copies of the cette attestation sont 
European patent application conformes a la version 
described on the following initialement deposee de 
page, as originally filed. fa demande de brevet 

europeen specifiee a la 
page suivante. 



Patentanmeldung Nr. Patent application No. Demande de brevet n° 

00202684. 7 



Der President des Europaischen Patentamts; 
Im Auftrag 

For the President of the European Patent Office 

Le President de TOffice europeen des brevets 
P.O. 




I.L.C. HATTEN-HECKMAN 



DEN HAAG,DEN 
THE HAGUE, 
LA HAYE,LE 



12/02/01 



EPA/EPO/OEB Form 1014 - 02.91 



Mi' *» 



US PAGE BLANK (uspto) 

t 




Europaisches 
Patentamt • 



European 
Patent Office 



Office europeen 
des brevets 



Blatt 2 der Bescheinigung 
Sheet 2 of the certificate 
Page 2 de I attestation 



Anmeldung Nr.: 
Application no.: 
Demande n": 



00202684.7 



Anmeldetag; 



Date of filing: 28/07/00 



Date de depot: 



Anmelder 

Applicant(s): 

Demandeur(s): 

Kon1nkl1jke Philips Electronics N.V. 

5621 BA Eindhoven 

NETHERLANDS 



Bezeichnung der Erftndung: 
Title of the invention: 
Titre de I'invention: 

System for controlling an apparatus with speech commands 



In Anspruch genommene Prioriat(en) / Priority(ies) claimed / Priorite(s) revendiquee(s) 

Staat: Tag: Aktenzeichen: 

State: Date: File no. 

Pays: Date: Numero de depot: 



Internationale Patent kiassifikat ion: 
International Patent classification: 
Classification internationale des brevets: 



Am Anmeldetag benannte Vertragstaaten: 

Contracting states designated at date of filing: AT/BE/CH/CY/DE/DK/ES/FI/FR/GB/GR/IE/IT/LI/LU/MC/NL/PT/SE^^ 
Etats contract ants designes lors du depot: 

Bemerkungen: 

Remarks: 

Remarques: 



EPA/EPO/OEB Form 1012 * 11.00 



/ 



• 



THIS PAGE BLANK (WPio) 




28-07-2000 j 9e " : ao ' «=3 BJ w ^EP00202684?j > epo/epa/oe^sw^k. Raoi JSPEC 

— 9:38 PHILIPS CIP NL +31 40 2743489 NO. 579 P 7 

PHTW000005EPP 

1 27,07.2000 
System for controlling an apparatus with speech commands 



FIELD OF THE INVENTION 

The invention relates to a system of an apparatus and a remote control for 
controlling said apparatus, the system comprising a speech processor for processing speech 
commands s the remote control comprising a microphone for enabling a user of said remote 
5 control to input said speech commands. The invention further relates to a remote control and 
an apparatus for use in the above system. The invention further relates to a method of 
controlling an apparatus, comprising a step of processing speech commands for controlling 
said apparatus. 

10 BACKGROUND OF THE INVENTION 

Recent developments in speech recognition techniques have enabled users of 
electronic systems to control said systems by means of spoken commands. Often, such 
systems are applied in a multi-user setting, e.g. a TV system is used by different members of 
a household. One of the problems with such systems is that two or more users may be 

1 5 speaking to the system at the same time. Since the users can speak the commands at any time 
and location, it is very difficult to resolve the input conflict. Even if the speakers utter their 
commands at different times, contradictory commands from different users still cause a 
problem. Obviously, discrimination of speech input is a serious concern for voice control 
with multiple users. 

20 US-A-5,777,571 discloses a solution to this problem. Users are registered as 

authorized users, and their speech commands are distinguished from speech commands from 
other users by means of voice identification techniques. A problem of this solution is that 
voice identification is an expensive and still unreliable technique, and new or occasional 
users need to be introduced to the system for authorization. Such an introduction often 

25 involves a lengthy training phase. Furthermore, this solution cannot resolve the problem that 
many speakers speak at the same time. 
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OBJECT AND SUMMARY OF THE INVENTION 

It is an object of the invention to provide an improved system and method of 
the type defined in the opening paragraph. To that end, the invention provides a system 
wherein the system also comprises a further microphone for enabling further users of the 
5 system to input speech commands The system according to the invention thus provides (at 
least) two microphones for controlling the apparatus. One of said remote controls is located 
on the remote control and is arranged to pick up speech commands uttered by the user of the 
remote control. The other microphone is located elsewhere, e.g. on the apparatus or at a 
central place in the room, and is arranged to pick up speech commands uttered by other users 

10 of the system whfch are not currently operating the remote control. In this way it is achieved 
that the system can distinguish speech commands from the user operating the remote control 
on the one hand, and other users on the other hand Hence, the speech commands uttered by 
the user who is operating the remote control are guaranteed to be optimally recognized, since 
the microphone is located relatively close to the user, and may have suitable characteristics to 

1 S pick-up sounds from the appropriate direction only. When multiple users are speaking 

simultaneously, the system may give priority to signals received from the microphone on the 
remote control, so that at least commands uttered by the user of the remote control will be 
recognized and processed. 

An embodiment of the system according to the invention is characterized by 

20 said further microphone being an omnidirectional microphone. In this way it is achieved that 
the position of the other users in the room is not critical. By contrast, the microphone on the 
remote control is preferably unidirectional and oriented such that it aims at its user's mouth 
when held in a normal manner. 

An embodiment of the system according to the invention is characterized by 

25 said system comprising input designation means for user operably designating said 

microphone and/or said further microphone as a signal source to said speech processor. In 
this way it is achieved that the system enables the speech commands obtained from the 
microphone and the further microphone to be selectively transmitted to the speech processor. 
For example* the speech processor may be controlled to process speech input from the 

30 microphone on the remote control only. Alternatively, the speech processor may be 

controlled to process speech input from the further microphone only, or from both the further 
microphone and the microphone on the remote control. Finally, the speech processor may be 
decoupled from both microphones, thus disabling the speech command facility. Preferably, 
the input designation means can be controlled via the remote control. For example, the 



untvangen: 2e/ 7/0O 9j39; 

fMM^^^M- ' 2000 9 : 39 PHIL 

PHTW000005EPP 




EPO/EPA/OE 1 

89 



JswijK; Pagm 
NO. 579 



. ti , . 3 27.07.2000 

remote control may comprise a button, enabling the user to switch between three different 
designations, e.g. respectively accept no speech processing at all, accept speech commands 
from the microphone on the remote control only, and accept speech commands from the 
farther microphone only. Alternatively, speech commands obtained from the further 
S microphone may be accepted by default, while pressing a button on the remote control causes 
the microphone on the remote control to be temporarily designated as the input source of the 
speech processor. 

BRIEF DESCRIPTION OF THE DRAWINGS 
10 These and other aspects of the invention are apparent from and will be 

elucidated, by way of a non-limitative example, with reference to the embodiments) 

described hereinafter. In the drawings, 

Figure 1 shows a television receiver and a remote control as an embodiment of 

a system according to the invention, 
1 5 Figure 2 shows a diagram of a television receiver as an embodiment of an 

apparatus according to the invention. 



DESCRIPTION OF EMBODIMENTS 

Figure 1 shows a television receiver 101 and a remote control 102 as an 

20 embodiment of a system according to the invention* The remote control 102 comprises an 
infrared (IR) transmitter 103, a microphone 104 and control elements including, inter alia, a 
button 105. The television receiver 101 comprises an IR receiver 106 and a further 
microphone 1 07, The television receiver can be operated by means of the remote control 1 02. 
For that purpose, the remote control 102 comprises a plurality of keys for controlling various 

25 functions of the television receiver 101. For example, the remote control 1 02 may comprise 
numerical keys, zap keys etc. which are well known in the art. Additionally, a user of the 
remote control 102 can enter speech commands via the microphone 104, which are then 
transmitted to the IR receiver 106 of the television receiver 101 and converted by a speech 
processor, described hereinafter, to corresponding control commands. The further 

30 microphone 107 is an omnidirectional microphone which picks up speech signals from any 
direction, thus enabling other users which are not currently holding the remote control 102 to 
control the television receiver 101 by means of voice commands. The button 105 controls 
input designation means, described hereinafter, for designating either or both microphone 
104 and further microphone 107 as a signal source for the speech processor. 
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Figure 2 shows diagram of a television receiver as an embodiment of an 
apparatus according to the invention. For consistency and ease of understanding, the same 
reference numerals as in Fig.l are used for items having functions similar to those presented 
in Figure 1 . TV signals are received from the ether by an antenna 201 or, alternatively, from a 
5 cable network. One of the TV signals is selected by a tuner 202, decoded and split into an 
audio signal, a video signal and a data signal. The audio signal is further processed by an 
audio processor 203 and a loudspeaker 204. The video signal is further processed by a video 
processor 205 and presented on a screen 206. The data signal is transmitted to a central 
processing unit (hereinafter "CPU") 208, which comprises one or more microprocessors 

1 0 capable of executing program instructions. These program instructions comprise parts of 
software modules including, inter alia, a command interpreter 209, and a speech processor 
211. The CPU 208 is capable of controlling functions of the TV-set and transmitting data to 
the video processor 205 to be presented on the screen 206, The command interpreter 209 
receives user commands from the IR receiver 106, which in turn receives IR signals from the 

1 5 remote control 1 02, and transmits them to the CPU 208 to be processed. For example, when 
the user enters a channel number the CPU 208 controls the tuner 202 to select the 
corresponding channel, and sends data to the video processor 205 to present feedback on the 
screen 206, e.g. the preset number, the channel name and the program category being 
displayed for a few seconds. When the user issues a zap-command, e.g. by pressing either of 

20 the up/down keys of the remote control, the same feedback is presented and the tuner 202 is 
controlled to select a channel which is arranged after or before the current channel. 

The CPU 208 is further capable of controlling a switch 215 which constitutes 
input designation means for the speech processor 211. The switch 215 can adopt three states. 
A first state of switch 21 5 designates the microphone 104 of the remote control 102 as the 

25 signal source foe the speech processor 211. Speech signals from the microphone 1 04 are 

converted into IR signals, received by the IR receiver 106 and transmitted to the switch 215. 
In the first sate of switch 215 the speech signals thus obtained from the microphone 104 are 
input to the speech processor 21 1 and converted to control commands which are then 
transmitted to the command interpreter 209. 

30 A second state of switch 215 designates the microphone 107 as the signal 

source for the speech processor 21 1. The microphone 107 is a part of the television receiver 
101 and signals obtained from the microphone 107 arc directly transmitted to a second 
contact of the switch 215. 
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A third state of switch 21 5 designates neither microphone 104 nor microphone 1 07 as the 
signal source for the speech processor 211, thus disabling speech input completely. 

The switch 21 5 is controlled by the CPU 208 in response to control signals 
received from the JR receiver 1 06, which control signals are in turn generated in response to 
5 the user of the remote control 102 operating the button 105. After system initialization the 
switch 215 adopts the first state as described above and depicted in Fig. 2. If the user presses 
the button 105, the switch 215 adopts the second state, thus designating the microphone 107 
as the signal source to the speech processor 21 1 and disabling the microphone 104. If the user 
presses the button 105 a second time, the switch 215 adopts the third state, thus designating 
1 0 neither microphone 1 04 nor microphone 1 07 . Pressing the button 1 05 again restores the first 
state. 

In an alternative embodiment the switch 21 5 can adopt a fourth state, wherein 
signals from both microphones are accepted However, if signals are received from both 
microphones simultaneously, signals received from the microphone 107 are disregarded in 

1 5 favor of the signals received from the microphone 1 04. 

In summary, the invention relates to a system comprising a speech processor 
for controlling an apparatus (101) with speech commands. The system according to the 
invention comprises a remote control (102) having a microphone (104) for entering speech 
commands. The system also comprises a further microphone (107) for enabling other users of 

20 the system to enter speech commands too. The system may comprise input designation means 
(105) for user operably designating said microphone (104) and/or said further microphone 
(170) as a signal source to the speech processor. 

Although the invention has been described with reference to particular 
illustrative embodiments, variants and modifications are possible within the scope of the 

25 inventive concept Thus, for example, the speech commands may be transmitted from the 
remote control by means of a radio frequency (RF) signals instead of an IR signals. 
Furthermore, a speech processor may also be comprised by the remote control instead of or in 
addition to the controlled apparatus. The input designation means may be controlled by 
means of a control element on the remote control or on the controlled apparatus. The control 

30 element may be a single state-toggle button as described above, or any other appropriate 
control element, such as a 'radio button' for each state, or a multi-position switch, each 
position of which corresponds to a particular state. The input designation means could be a 
switch as described hereinbefore, or a more sophisticated switching circuit under control of 
the speech processor itself. This would enable the input designation means to be controlled 
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by means of dedicated speech commands irrespective of the currently selected state. The 
speech processor would then first try to detect such dedicated speech commands irrespective 
of whether it is allowed to accept speech commands from the respective microphone, and 
subsequently adjust the input designation in accordance with the dedicated speech 
commands. Preferably, only a user of the remote control would be allowed to control the 
input designation means in such manner. 

More than two microphones may be applied, which may be located at various 
positions within the system, e.g. on the apparatus and the remote control as already described, 
or in the vicinity of the controlled apparatus. The system may also comprise multiple remote 
controls, each comprising a microphone. Preferably, one of the remote controls serves as a 
master control and is the only remote control capable of controlling the input designation 
means. 

The word 'comprising 1 does not exclude the presence of elements or steps 
other than those listed in a claim. In the claims, any reference signs placed between 
parentheses shall not be construed as limiting the claim. The invention can be implemented 
by means of hardware comprising several distinct elements, and by means of a suitably 
programmed computer. In the device claim enumerating several means, several of these 
means can be embodied by one and the same item of hardware. 
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CLAIMS: 



1, A system of an apparatus (101) and a remote control (102) for controlling said 
apparatus, the system comprising a speech processor (211) for processing speech commands, 
the remote control comprising a microphone (104) for enabling a user of said remote control 
to input said speech commands* wherein the system also comprises a further microphone 

5 (107) for enabling further users of the system to input speech commands. 

2. A system as claimed in claim 1, said further microphone (107) being an 
omnidirectional microphone. 

10 3. A system as claimed in claim 1 or 2, said further microphone (107) being 

comprised by said apparatus (101). 

4. A system as claimed in any of claims 1 to 3, said system comprising input 
designation means (209,215) for user operably designating said microphone (104) and/or said 

15 further microphone (1 70) as a signal source to said speech processor (211). 

5. A system as claimed in claim 4, said input designation means (209,21 5) being 
capable of controlling the speech processor (21 1) to process speech input from said 
microphone (104) only, 

20 

6. A system as claimed in claim 4, said input designation means (209,21 5) being 
capable of controlling the speech processor (21 1) to process speech input from said further 
microphone (107) only. 

25 7. A system as claimed in claim 4, said input designation means (209,21 5) being 

capable of controlling the speech processor (21 1) to process speech input from both said 
microphone (104) and said further microphone (107), said microphone (104) having priority 
over said further microphone (107). 



8 27.07.2000 

8. A remote control (102) for use in a system as claimed in any of claims 1 to 7, 
comprising a microphone (104) for enabling a user of said remote control to input speech 
commands for processing by a speech processor (21 1), said remote control comprising input 
designation means for user operably designating said microphone (104) and/or a further 
microphone (107) as a signal source to said speech processor. 

9. An apparatus (101) for use in a system as claimed in any of claims 1 to 7, 
comprising a speech processor (21 1) for processing speech commands, and input designation 
means (209,21 5)*for user operably designating a microphone (104) comprised by a remote 
control (102) of said apparatus and/or a further microphone (107) comprised by said 
apparatus as a signal source to said speech processor. 

10. A method of controlling an apparatus (101), comprising a step of processing 
speech commands for controlling said apparatus, and a step of user operably designating a 
microphone (104) comprised by a remote control of said apparatus and/or a further 
microphone (107) comprised by said apparatus as a signal source to said speech processing. 
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ABSTRACT: 



The invention relates to a system comprising a speech processor for 

controlling an apparatus (101) with speech commands. The system according to the invention 

comprises a remote control (102) having a microphone (104) for entering speech commands. 

The system also comprises a further microphone (107) for enabling other users of the system 

5 to enter speech commands too. The system may comprise input designation means (105) for 

user operably designating said microphone (104) and/or said further microphone (170) as a 

signal source to the speech processor. 
&« 

Fig. 1 
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FIG. 2 
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